AN EXTENSION WHICH IS RELATIVELY TWOFOLD MIXING 
BUT NOT THREEFOLD MIXING 



THIERRY DE LA RUE 

Abstract. We give an example of a dynamical system which is mixing relative 
to one of its factors, but for which relative mixing of order three does not hold. 



1. Factors, extensions and relative mixing 

1.1. Factors, extensions and Rokhlin cocycle. We are interested in dynamical 
systems (X, ^,T), where T is an ergodic automorphism of the Lebesgue space 
(X, srf ', ji). We will often designate such a system by simply the symbol T. A factor 
of T is a sub-er-algebra 3t of s£ such that 3tf = T^ 1 ,^ . 

The canonical example of a system with factor is given by the skew product, 
constructed from a dynamical system (Xh , s^h-, /J-h, Tjj) (called the base of the skew 
product) and a measurable map x i — ► S x from Xh to the group of automorphisms 
of some Lebesgue space (Y,S§, v) (such a map is called a Rokhlin cocycle). The 
transformation is defined on the product space {Xjj x Y, s/jj <g> 38, hh ® v ) by 

T(x,y) = (T H x,S x y). 

In this context, the sub-cr-algebra s^h ® {Y, 0} is clearly a factor of T . 

Since the work of Abramov and Rokhlin [1], this kind of construction is known 
to be the general model for a system with factor: If is a factor of T, then there 
exists an isomorphism tp between T and a skew product T constructed as above, 
with ip(Jf?) = s^h <8 {Y, 0}. In such a situation, we say that T is an extension of 
T H - 

1.2. Mixing relative to a factor. To understand precisely the way a factor is 
embedded in the dynamical system, one is led to study the behaviour of the system 
relative to the factor; to this end, relative properties are defined which are general- 
izations of absolute properties of dynamical systems. For example, one can define 
weak-mixing relative to a factor (see e.g. [2]), or the property of being a K-system 
relative to a factor [4], 

We are interested in this work in the property of being mixing relative to a factor. 

Definition 1.1. Let 34? be a factor of the system (X, , fi,T). T is said Jf- 
relatively mixing if 

(1) VA,Besf, fi(AnT- k B\jr) - f i(A\Jf)iJ l (T- k B\Jf) proba » 0. 

k — >+oo 

As for the absolute property of mixing, it is possible to define mixing relative to 
a factor of any order n > 2. The property described by (1) corresponds to relative 
mixing of order 2 (twofold relative mixing) ; for relative mixing of order 3 (threefold 
relative mixing), (1) should be replaced by 
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(2) VA,B,Ce< 

H (A n T~'B n T- k C\Jf) - /j,{A\Jf?)ij,(T- j B\Jf)(j,(T- k C\jr) pr ° ba > 0. 

3,k-j— >+oo 

Whether (absolute) twofold mixing implies threefold mixing is a well-known open 
problem in ergodic theory. The main goal of this work is to show that as far as 
relative mixing is concerned, twofold does not necessarily imply threefold. 

Theorem 1.1. We can construct a dynamical system (X, srf ', ji, T) with a factor 
such that T is Jf? -relatively twofold mixing but not -relatively threefold mixing. 

2. An extension which is relatively twofold mixing but not 
relatively threefold mixing. 

2.1. The base. The dynamical system announced in Theorem 1.1 is constructed 
as a skew product, whose base (Xh, , ^h,Th) is obtained as follows: Take 
Xh '■= [0, 1[ equipped with the Lebesgue measure fin on the Borcl cr-algebra s^h- 
The transformation Th can be viewed as a triadic version of the Von Neumann- 
Kakutani transformation; we describe now its construction by the cutting and stack- 
ing method (see Figure 1). 

We begin by splitting Xh into three subintervals of length 1/3; we set B\ :— 
[0, l/3[. The transformation Th translates B\ onto Th B\ := [1/3, 2/3[, and trans- 
lates T H Bi onto T H 2 Bi := [2/3, 1[. At this first step, T H is not yet defined on 
Th 2 B\. In general, after the n-th step of the construction, Xh has been split 
into 3™ intervals of same length: B n ,Tn B n , . . . , Th 3 _1 B n . These intervals form 
a so-called Rokhlin tower with base B n and height 3". Such a tower is usually 
represented by putting the intervals one on top the other, the transformation Th 
mapping each point to the one exactly above. At this step, the transformation is 
not yet defined on Th 3 1 B n . Step n+ 1 starts by chopping the base B n into three 
subintervals of the same length, the first of which is denoted by B n+ \. The n-th 
Rokhlin tower is thus split into three columns, which are stacked together to get 
the n + 1st tower. This amounts to mapping Th 3 1 B n+ i onto the second piece 
of B n by a translation, and T# 2x3 _1 B n+ \ onto the third piece of B n . Th is now 

ofl+l_i 

defined everywhere except on Th Bn+i- 

The iteration of this construction for all n > 1 defines Th everywhere on Xh- 
The transformation obtained in this way preserves Lebesgue measure, and it is well 
known that the dynamical system is ergodic. 

2.2. The extension. In order to construct the extension of Th, we will now define 
a Rokhlin cocycle x i — > S x from Xh into the group of automorphisms of (Y, SS, v), 
where Y := { — 1, 1} N , SB is the Borel cr-algebra of Y, and v is the probability 
measure on Y which makes the coordinates independent and identically distributed, 
with v{\)k = 1) = v{yk = —1) = 1/2 for each k > 0. 

If y = (j/fe)feeN e Y and < i < j, we denote by y\\ the finite word yiy l+ i ■ ■ ■ yj. 
For each n > 0, we call n-block a word of length 2™ on the alphabet {—1, 1}. Lc 
first n-block of y is thus y\g _1 . If w\ = yo . . . j/2"-i and u>2 = zq . . . Z2»-i arc two 
n-blocks, we denote by w\ u>2 the (n + l)-block obtained by the concatenation of 
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Figure 1 . Construction of Tjj by cutting and stacking 



W\ and W2, and wi .XW2 the n-block defined by the termwise product of w\ and 

wiw 2 ■= yo---y2"-i2o..-^2«-i, and wi.xw 2 ■= (yo X z ) • • • (s/2»-i x^»-i). 

For each n > 1, we now define a transformation r n of Y which will be useful for 
the construction of the Rokhlin cocycle. This transformation only affects the first 
n-block of y : if this first n-block is W1W2 (where w\ and W2 are (n — l)-blocks), 
then the first n-block of r„ y is W2 (wi .x u^)- Coordinates with indices at least 2™ 
of T n y remain unchanged. The two following properties of r n are easy to verify: 

• r„ preserves the probability v, 

• r n 3 = Id r . 

For every x G Xjj, we denote by n(x) the smaller integer n > 1 such that a; does 
not belong to the top of tower n. In other words, n(x) is the integer n > 1 such 
that a; is at the step rt of the construction of Th- We then set 

Sx ■= T„( K ) o r„( x )_x o • • • o n. 

From the properties of r„, it is easy to derive that S x is always an automorphism 
of (Y, v). From now on, we denote by T the skew product on Xh x Y equipped 
with the product measure \in <g) v defined by 

T(x,y) := (T H x,S x y), 

Let be the factor of T given by the er-algebra s$h ® {Y, 0}. 

2.3. Relative twofold mixing which is not threefold. Let n > 1, and (x,y) € 
Xh x Y with a; in the base B n of the n-th tower. For each k > 0, we denote 
by y^ the point of Y defined by T k (x,y) — (Tn k x,y^). From the construction 
of the Rokhlin cocycle, while T# fe x has not reached the top of tower n, y is only 
transformed by some Tj with j < n. Therefore, in the sequence y^ , y^ , . . . , ?/ 3 _1 ' 
(corresponding to the climb of x upward tower n) , only the first n-block is modified 
and these modifications do not depend on the coordinates of y with indices at least 
2". 
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We are particularly interested in the sequence y$ Ho • ■ - Ho °f coordinates 
with null index, which we see as a random colouring of the climb of x upward tower 
n. From the preceding remark, this colouring only depends on the first n-block of 
y. Therefore there exists some map j n : { — 1, l} 2 — > { — 1, l} 3 such that 

(0) (1) (3"-l) / i2 n -l\ 

Vo Vo •■•Vo = 7n[y\ )■ 

Lemma 2.1. Assume further that x lies in the base of the first or second column 
in tower n (i.e. x <G B n+ i or x € Th 3 B n+ \). Then 

y (3 "' = r n+1 y. 

Proof. It is easily checked by induction on n, using the fact that t„ 3 = Idy. □ 

Lemma 2.1 gives a relation between 7„ and 7„+i. Indeed, if x lies in B n+ i, 
the climbing of x upward tower (n + 1) can be seen as three successive climbings 
of x upward tower n, whose colourings are given by y^ = y, ?/ 3 ' = r n+ i y and 
^(2x3") _ Tn+l 2 y, j t f n ows that ^he colouring of the first climbing of x upward 
tower n is coded by the first n-block y\ _1 of y, the colouring of the second climbing 

i2™ +1 — 1 

of x upward tower n is coded by the second n-block y| 2 „ , and the colouring 
of the third climbing of x upward tower n is coded by their termwise product 
y|g _1 .x y\\ n _1 . Hence, if w is an (n + l)-block which is the concatenation of 
the two n-blocks w\ w 2 , we have 

(3) 7n+lO) = ln{w 1 )^n(.W 2 )ln(w 1 .X W 2 ) . 

Therefore, the sequence (-f n )n>i of coding maps is entirely determined by 

71 : a b i — ► ab(a x b) 

and the recurrence relation (3). The proof of the following lemma follows easily: 

Lemma 2.2. Lef w\ and w 2 be two n-blocks. Then 

-f n (w 1 .xw 2 ) = 7„(wi) .x 7„(w 2 )- 

From the preceding observations, we can deduce some properties of the condi- 
tional law of the colouring process knowing x. 

Proposition 2.1. Let x € Xh o,nd n > 1. Let j > &e i/ie smallest integer 
such that Th 3 x e B n+ \. We denote by C™, and i/ie respective random 
colouring of the three successive climbings of x upward tower n. The conditional 
law of (Ci,C 2 ,C^) knowing Jf? satisfies the following properties : 

• C™, C 2 and are identically distributed; 

• Ci,C 2 and C$ are pairwise independent; 

• C% = Cf .xC£. 

Proof. Since Jrf? is a T-invariant cr-algebra, we can always assume to simplify the 
notations that j = (i.e. x G B n+ i). It follows from what has been seen be- 
fore that C™, C% and C3 are given respectively by 7 n (y|g J n (y\ 2 n _1 ) an d 

7n(y|o — 1 - x y ||„ But these three n-blocks y | g — X , 1 and _1 - x y| 2 ™ 

are identically distributed and pairwise independent. Therefore, the three colour- 
ings are themselves identically distributed and pairwise independent. The equality 
C™ = C™ .x C 2 is a straightforward consequence of Lemma 2.2. □ 
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It follows easily from Proposition 2.1 that the property (1) characterizing twofold 
mixing relatively to the factor ffl is true when A and B are measurable with respect 
to a finite number of coordinates of the colouring process (yo ° T k )kez- Indeed, in 
such a case we can find an integer n (depending on x) such that A and B are 
measurable with respect to one of the blocks Cf (i = 1,2 or 3) defined in the 
previous proposition. Then, as soon as k > 3™, A and T~ h B are given by two 
blocks C™ (for some m > n) which are independent under the conditional law 
knowing Jif. 

Then, (1) extends by density to every sets A and B measurable with respect to 
the cr-algebra generated by and the colouring process (yo T k )ke%- But this 
cr-algebra is easily shown to be the whole g?H®£$, since knowing x and (yo°T k )ke% 
we can always recover each coordinate y n , n G N. (Details are left to the reader.) 
It follows that the system is ^-relatively twofold mixing. 

However, the system is not Jtf '-relatively threefold mixing: If A, B and C are 
defined by 

A = B = C := {(x,y) : y = l}, 

we have 

n{A\JfT) = v(B\J4?) = n(C\Jf) = 1/2, 
but for each n > 1 and each x in the first column of tower n, 

^(AnT- 3 "BnT- 2x3 "c|jr) = 1/4. 

3. Comments and questions 

Joinings. The question of the existence of a system which is twofold but not three- 
fold mixing is strongly connected with the following question: Does there exist a 
joining of three copies of some weakly mixing, zero-entropy dynamical system which 
is pairwise independent but which is not the product measure? In [3], Lemahczyk, 
Mcntzen and Nakada answer positively to the relative version of this problem: They 
construct a relatively weakly-mixing extension T of an ergodic rotation Th, and 
a 3-joining A of T identifying the three copies of Th, which is pairwise but not 
threewise independent relative to Th- However their construction does not seem to 
come from an extension which is twofold but not threefold relatively mixing. 

Mixing in the base? The example which we have presented above can easily be 
modified in order to make the dynamical system in the base weakly mixing. Indeed, 
we can replace the triadic Von Neumann-Kakutani by Chacon's transformation, 
whose construction is similar with the only following difference: In each step of 
the construction we add a supplementary spacer interval between second and third 
column. The sequence (h n ) of the heights of the successive towers thus satisfies 
hn+i = 3h n + 1. It is well known that Chacon's transformation is weakly, but not 
strongly, mixing. Defining S x in a similar way when x does not lie in some spacer, 
and S x := Id in any spacer, we get the same conclusion concerning twofold but 
not threefold relative mixing. The lack of threefold relative mixing is checked by 
considering, for x in the first column of tower n, /i(A n T~ hrl B n T~^- 2hn+1 ^ C\Jff) . 

Then it is natural to look for a similar result with the dynamical system in 
the base strongly mixing. Indeed, it is easily shown that if Th is mixing and if 
T is J^-relatively mixing, then T is mixing. This would give some hope to get a 
transformation that is twofold but not threefold mixing. However, there seem to 
be serious obstacles to achieving the same kind of construction with a mixing base. 
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On the definition of relative mixing. In the present work we have used the definition 
of relative mixing defined by the convergence to zero in probability (or equivalently 
in L 1 ) of the sequence 

(4) n(AnT- k B\Jf) - fi(A\J{?)n(T- k B\Jf). 

An other possible definition of relative mixing is used by Rahe in his work on factors 
of Markov processes [5]: In this paper, a process (x k ) ke % (with x k = x o T k ) is 
said J^f -relatively mixing if, for all A and B measurable with respect of a finite 
number of cooordinates of the process (x k ), the convergence of (4) to zero holds 
almost surely. 

The difference between these two definitions is discussed in a recent work of 
Rudolph [6] , where it is shown that there exists a system which is relatively mixing 
with respect to one of his factors in the L 1 sense, but not in the almost-sure sense. 
Rudolph also shows that checking almost-sure convergence of (4) to zero for a dense 
class of subsets A and B (as in Rahe's definition) implies that the same convergence 
holds for every A and B. 

It is not difficult to see that, for the example we presented here, the same results 
concerning twofold and threefold relative mixing hold if we replace L 1 convergence 
by almost-sure convergence. 
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